Speaker recognition using PCA-based feature transformation
نویسندگان
چکیده
منابع مشابه
Acoustic feature transformation using UBM-based LDA for speaker recognition
In state-of-the-art speaker recognition system, universal background model (UBM) plays a role of acoustic space division. Each Gaussian mixture of trained UBM represents one distinct acoustic region. The posterior probabilities of features belonging to each region are further used as core components of Baum-Welch statistics. Therefore, the quality of estimated Baum-Welch statistics depends high...
متن کاملUsing MAP estimation of feature transformation for speaker recognition
We propose to use a new feature transformation (FT) function to construct supervectors of support vector machines for speaker recognition. Considering that estimation of bias vectors is more robust than that of transformation matrices, we define the FT function in a flexible form that transformation matrices and bias vectors are controlled by separate regression classes. Unlike the MLLR-based a...
متن کاملSpeaker adaptation in transformation space using two-dimensional PCA
This paper describes a principled application of twodimensional principal component analysis (2DPCA) to the decomposition of transformation matrices of maximum likelihood linear regression (MLLR) and its application to speaker adaptation using the bases derived from the analysis. Our previous work applied 2DPCA to speaker-dependent (SD) models to obtain the bases for state space. In this work, ...
متن کاملSpeaker state recognition using an HMM-based feature extraction method
In this article we present an efficient approach to modeling the acoustic features for the tasks of recognizing various paralinguistic henomena. Instead of the standard scheme of adapting the Universal Background Model (UBM), represented by the Gaussian ixture Model (GMM), normally used to model the frame-level acoustic features, we propose to represent the UBM by building monophone-based Hidde...
متن کاملSpeaker recognition based on feature space trace
This paper presents a multiple templates matching algorithm based on feature space trace, which is used in speaker recognition. It extracts the cepstrum coefficient as feature parameter. We normalize the sequence of feature parameter based on feature space trace. The fuzzy c-means method is adopted in generating the multiple templates and the multiple matching method is applied to match the tem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Speech Communication
سال: 2019
ISSN: 0167-6393
DOI: 10.1016/j.specom.2019.04.001